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SUMMARY 


A  variety  of  cognitive  /perceptual  abilities  have  been  linked  to  flying  performance.  Two 
tests.  Encoding  Speed  (encoding  and  classification  ability)  and  Immediate /Delayed  Memory  (short¬ 
term  memory  retrieval),  were  administered  to  2,219  United  States  Air  Force  pilot  candidates  prior 
to  entry  Into  Undergraduate  Pilot  Training  (UPT).  Although  both  tests  were  reliable,  only 
performance  on  the  Encoding  Speed  test  was  shown  to  be  related  to  flight  training  performance. 
Pilot  candidates  who  made  quick  and  accurate  responses  on  the  Encoding  Speed  test  were  more 
likely  to  perform  well  on  In-flight  performance  measures  and  be  recommended  for  additional 
training  In  a  fast-jet  (Fighter-Attack -Reconnaissance)  aircraft  upon  completion  of  UPT.  Implica¬ 
tions  for  pilot  selection  and  early  classification  (for  Specialized  Undergraduate  Pilot  Training) 
are  discussed. 
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PREFACE 


This  work  was  completed  under  Work  Unit  77191845  In  support  of  a  Request  for 

Personnel  Research  (RPR  78-11,  Selection  for  Pilot  Training)  submitted  by  training 

program  managers. 

This  paper  is  Intended  to  serve  as  an  interim  report  regarding  two  of  the 

cognitive/perceptual  tests  of  the  Basic  Attributes  Tests  (BAT)  battery. 
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RELATIONSHIP  OF  ENCODING  SPEED  AND  MEMORY  TESTS 
TO  FLIGHT  TRAINING  PERFORMANCE 


I.  INTRODUCTION 


As  aircraft  have  become  Increasingly  sophisticated  with  modern  technological  advances,  many 
functions  previously  performed  by  the  pilot  have  been  automated.  These  advances,  however,  have 
increased  the  demands  placed  on  the  pilot's  cognitive  abilities.  Crucial  to  pilot  performance 
are:  the  speed  and  accuracy  with  which  stimuli  are  perceived,  encoded,  stored,  transformed,  and 
compared;  the  speed  with  which  memory  Is  searched  and  accessed;  and  the  speed  with  which 
decisions  are  made. 

Most  of  the  research  In  the  area  of  cognitive  factors  related  to  pilot  performance  has 
focused  on  pretraining  techniques  to  Improve  comprehension  and  Integration  of  Information  needed 
to  fly  the  aircraft  (Crosby,  1977;  Gerlach,  1974;  Jensen  A  Benel ,  1977).  When  tests  of  cognitive 
abilities  have  been  used  for  pilot  selection  rather  than  for  training,  the  emphasis  has  been  on 
selective  attention  (Gopher  6  Kahneman,  1971)  and  time-sharing  ability  (North  4  Gopher,  1976). 
Recent  test  batteries  designed  to  assess  the  performance  of  aerospace  ground  equipment  crews 
(McLaurln,  1973),  Air  Force  navigator  trainees  (Hunter,  1975),  and  advanced  simulator  trainees  in 
Undergraduate  Pilot  Training  (UPT)  (Pew,  Rollins,  Adams,  4  Gray,  1977)  have  Included  several 
cognitive  tasks  Involving  memory,  spatial  visualization,  verbal  reasoning,  and  other  abilities. 

A  review  of  these  and  other  studies  helped  lead  to  the  development  of  a  computer-acini  nlstered 
test  battery,  the  Basic  Attributes  Tests  (BAT),  designed  to  Improve  the  selection  and 
classification  of  United  States  Air  Force  pilot  and  navigator  trainees.  The  original  BAT  battery 
consisted  of  15  tests;  It  measured  psychomotor  skills,  as  well  as  a  variety  of  cognitive/ 
perceptual  abilities  and  personal Ity/attltudlnal  characteristics  believed  to  be  related  to  pilot 
and  navigator  performance  (see  Carretta,  1987a,  for  a  more  complete  description  of  the  BAT 
battery). 

Several  of  the  cognitive  abilities  tests  in  the  BAT  battery  have  been  evaluated  In  terms  of 
their  ability  to  predict  various  flight  performance  measures  and  final  training  outcome  (see 
Table  1).  These  Include  Digit  Memory  (Information  Input  efficiency),  Decision-Making  Speed 
(choice  reaction  time).  Item  Recognition  (short-term  memory  storage,  search,  and  comparison 
operations).  Mental  Rotation  (spatial  transformation),  and  Time-Sharing  (higher-order  tracking 
ability,  and  learning  rate  and  time-sharing  ability  as  a  function  of  differential  task  load).  Of 
particular  Interest  was  the  potential  of  these  cognitive  tests,  which  rely  heavily  on 
paper-and-pencll  measures,  to  Increase  the  validity  of  current  selection  procedures  used  by  the 
U.S.  Air  Force  to  select  pilot  candidates.  Although  the  experimental  cognitive  tests 
Investigated  did  not  add  significantly  to  the  prediction  of  graduation  or  elimination  from  UPT, 
they  did  demonstrate  significant  relationships  with  several  other  performance  measures  Including 
recocrmendatlons  for  fighter  assignments  after  training  (Carretta,  1987a,  1987b,  1987c).  Pilot 
candidates  who  made  quick,  consistent,  and  accurate  responses  on  these  tests  were  more  likely  to 
perform  better  during  flight  training  and  receive  a  post-training  assignment  for  a  fighter 
aircraft. 

This  paper  examines  the  predictive  utility  of  two  remaining  cognitive  tests  In  the  BAT 
battery;  namely.  Encoding  Speed  (classification)  and  Irmnedlate/Delayed  Memory  (short-term  memory 
retrieval).  The  general  hypotheses  guiding  this  Investigation  were  that  Individual  differences 
In  performance  on  the  tests  would  predict  UPT  performance  and  also  that  use  of  the  tests  in 
combination  with  the  Air  Force  Officer  Qualifying  Test  (AFOQT)  would  add  significantly  to  the 
validity  of  predictions  concerning  flight  training  success  (Rogers,  Roach,  6  Wegner,  1986).  In 
particular.  It  was  expected  that  subjects  with  quicker  reaction  times  and  greater  response 
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accuracy  would  be  more  likely  to  succeed  In  training,  and  that  these  differences  would  be 
reflected  In  better  flight  performance  scores  (check  flight  scores),  which  have  a  broader  range 
than  the  dichotomous  final  training  outcome  measure  ( UPT  pass/fall).  The  fact  that  the  pass/fall 
rate  Is  unevenly  distributed  (80S  pass  versus  20*  fall)  also  makes  that  criterion  less  sensitive. 


Table  1.  Cognitive/Perceptual  Tests  In  the  BAT  Battery 


Test 

Reference 

Cognitive/Perceptual  ability 

Digit  Memory 

Hunter,  1975 

Perceptual  speed.  Information 
Input  efficiency 

Decision-Making 

Speed 

Fleishman  and 
Hempel,  1955 

Choice  reaction  time 

Item  Recognition 

Sternberg,  1966 

Short-term  memory  storage, 
search  and  comparison 

Mental  Rotation 

Shepard  and 

Metzler,  1971 

Spatial  transformation 

Time-Sharing 

North  and 

Gopher,  1976 

Selective  attention 

Encoding  Speed 

Posner  and 

Mitchell,  1967 

Classification  speed 

Immediate/ 

Oelayed  Memory 

Hunter,  1975 

Memory  retrieval 

If  the  scores  from  the  two  cognitive  tests,  taken  together  with  AFOQT  scores,  failed  to 
demonstrate  a  stronger  relationship  with  training  performance  outcomes  than  the  AFOQT  alone, 

there  would  be  no  reason  to  go  to  the  effort  to  modify  or  replace  the  current  test  system. 

Conversely,  If  the  cognitive  tests  were  found  to  add  to  the  validity  of  the  current  test 

procedure,  this  could  be  Interpreted  as  evidence  that  the  cognitive  tests  measure  unique 
abilities  unrelated  to  those  associated  with  the  AFOQT. 

In  addition  to  predicting  successful  completion  of  pilot  training,  the  Air  Force  Is 
Interested  In  classifying  pilots,  as  early  as  possible,  for  advanced  training.  Currently,  UPT 
lasts  49  weeks  and  Includes  about  175  hours  of  flying  time.  Based  on  an  evaluation  by  an 
Advanced  Training  Recomnendatlon  Board  (ATRB)  in  the  42nd  week  of  UPT,  pilot  candidates  are 
recomnended  for  one  of  two  advanced  training  tracks:  Fighter-Attack-Reconnaissance  (FAR) 
aircraft  or  Tanker-Transport-Bomber  (TIB)  aircraft.  In  genera),  the  students  who  perform  best 
during  UPT  are  selected  for  fast-jet  training  ( 1  .e . ,  FAR).  Therefore,  It  was  hypothesized  that 
the  FAR-recommended  pilots  would  demonstrate  quicker,  more  consistent,  and  more  accurate 
performance  on  the  cognitive /perceptual  tests  than  would  the  TTB-recommended  pilots.  The 

demonstration  of  such  a  relationship  would  enable  the  Air  Force  to  Identify  potential  FAR  and  TTB 
candidates  early  In  UPT  and  thus  result  In  more  efficient  and  cost-effective  training. 


II.  METHOD 
Subjects 


The  subjects  In  this  study  were  2,219  Air  Force  officer  candidates  targeted  for  UPT.  They 
were  tested  on  the  Encoding  Speed  and  Iitwedi  ate  /Delayed  Memory  tests  prior  to  their  entry  Into 
UPT.  Only  subjects  who  also  had  scores  on  the  AFOQT  were  Included  In  the  regression  analyses 


that  predicted  performance  on  the  UPT  final  outcome  measure.  Pilot  training  performance  measures 
were  available  for  only  a  portion  of  these  subjects,  as  many  of  them  had  not  yet  completed  UPT. 


Procedure 

Each  subject  was  tested  on  the  AFOOT  prior  to  entry  Into  pilot  training.  This  test  provided 
five  composite  scores  based  on  several  subtests:  Verbal,  Quantitative,  Academic  Aptitude  (verbal 
and  quantitative  combined),  Navigator-Technical,  and  Pilot.  Only  the  Navigator-Technical  and 
Pilot  composite  scores  have  been  used  In  the  operational  selection  of  candidates  for  UPT  (USAF, 
1983).  The  predictive  utility  of  the  AFOQT-PIlot  composite  score  served  as  a  baseline  to  judge 
the  usefulness  of  the  Encoding  Speed  and  Immediate /Delayed  Memory  tests  for  improving  pilot 
selection  and  classification  for  specialized  training,  as  this  composite  has  demonstrated  a 
consistent  relationship  with  pilot  training  performance  (Arth,  1986). 

The  Encoding  Speed  and  Immediate /Delayed  Memory  tests  were  Included  In  the  BAT  battery. 
After  a  test  administrator  Initiated  the  system,  the  test  session  was  self-paced  by  the  subject. 
The  test  session  lasted  about  3  1/2  hours  and  included  scheduled  breaks  between  tests  to  avoid 
problems  with  mental  and  physical  fatigue. 

AFOQT-PIlot  Composite 

The  AFOQT-PIlot  composite  score  Is  based  on  performance  on  8  of  the  16  AFOQT  subtests.  These 
subtests  assess  a  variety  of  skills  Including  verbal  reasoning,  mechanical  and  Instrument 
comprehension,  scale  and  table  reading,  spatial  transformation,  and  general  aviation  knowledge. 

Encoding  Speed 

According  to  Posner  and  Mitchell  (1967),  performance  on  this  task  reflects  verbal 
classification  ability  at  several  levels  of  cognitive  processing. 

In  this  task,  subjects  were  presented  simultaneously  with  a  pair  of  letters  and  required  to 
make  a  same-different  judgment  about  the  letter  pair.  This  judgment  was  based  on  one  of  three 
decision  rules:  Physical  Identity  (AA  versus  Aa),  Name  Identity  (AA  versus  AE ) ,  or  Category 
Identity  (vowels  versus  consonants  -  AE  versus  AH).  The  latency  of  the  encoding  judgment 
provides  a  measure  of  the  speed  of  the  encoding  process.  Latency  differences  among  the  three 
types  of  judgments  Indicate  the  speed  of  recoding.  For  Instance,  the  reaction  time  for  the  Name 
Identity  judgments  minus  reaction  time  for  Physical  Identity  judgments  Is  assumed  to  Indicate  the 
speed  with  which  physical  stimuli  may  be  recoded  to  the  level  at  which  their  name  may  be  accessed. 

The  Encoding  Speed  test  consisted  of  three  32-item  blocks  of  trials,  which  required  about  15 
minutes  to  complete.  Response  time  and  accuracy  were  recorded  on  each  trial. 

Immediate/Delayed  Memory 

This  test  was  designed  to  assess  continuous  short-term  memory  storage  and  retrieval 
operations. 

In  this  task,  the  subject  was  presented  with  a  sequence  of  digits  and  required  to  respond  by 
indicating  the  digit  that  occurred  either  one  or  two  digits  previously.  There  were  two  parts  to 
each  of  these  subtasks.  In  the  first  part,  the  digits  were  presented  for  1/2  second,  followed  by 
a  2-second  Inter-stimulus  interval.  In  the  second  part,  the  inter-stimulus  interval  was  5 
seconds.  Thus,  for  both  the  one-back  and  two-back  subtasks,  part  one  dealt  with  "immediate" 
memory,  and  part  two  with  "delayed"  memory. 


There  were  25  trials  In  each  level  of  subtask  (one-  versus  two-back)  by  length  of  latency  (2 
versus  5  seconds)  condition,  resulting  In  100  trials.  As  with  Encoding  Speed,  response  time  and 
accuracy  were  recorded  on  each  trial.  The  Iimedlate/Delayed  Memory  test  required  about  25 
minutes  for  completion. 


UPT  Performance  Criteria 


UPT  final  outcome  was  assigned  at  the  completion  of  UPT  and  was  recorded  as  a  dichotomous 
variable  (fall  *  0  and  pass  5  1).  Subjects  who  satisfactorily  completed  UPT  received  a 
recomnendatlon  for  follow-on  training  In  either  a  fast-jet  (FAR)  or  a  slower  aircraft  (TTB)  by  an 
Advanced  Training  Recomnendatlon  Board  (ATRB)  consisting  of  T-38  Instructor  Pilots  (TTB  *  0  and 
FAR  *  1).  Generally,  FAR  aircraft  are  considered  to  be  more  demanding  than  TTB  aircraft. 

UPT  final  outcome  and  ATRB  recomnendatlon  were  determined.  In  part,  by  a  subject's 
performance  on  six  check  flights  aurlng  UPT.  A  check  flight  Involved  an  In-flight  performance 
evaluation  by  an  Instructor  Pilot.  The  first  three  check  flights  took  place  in  a  T-37,  low- 
performance  jet  trainer.  Three  later  flights  took  place  In  a  T-38,  higher-performance  supersonic 
jet  trainer.  The  T-37  check  flights  Included:  midphase  contact,  a  subject's  first  check  flight; 
contact.  In  which  the  subject's  ability  to  perform  maneuvers  and  aerobatics  by  visual  cues  from 
outside  the  plane  was  evaluated;  and  instrument.  In  which  the  subject  was  required  to  perform 
maneuvers  by  reference  to  the  displays  on  cockpit  Instruments.  The  T-38  check  flights.  In 
addition  to  contact  and  Instrument,  Included  evaluation  of  the  subject's  ability  to  fly  in 
formation  with  other  aircraft.  Each  subject  received  an  overall  check  flight  grade  (1 -unsatis¬ 
factory,  2-falr,  3-good,  or  4-excellent)  and  a  percentage  score  (based  on  performance  of  certain 
maneuvers  within  the  flight)  for  all  check  flights  completed  during  training.  Table  2  provides 
a  sumary  of  the  number  of  subjects  who  had  scores  on  the  two  BAT  tests  and  the  UPT  performance 
criteria. 


Table  2.  Humber  of  Subjects 


Tests 

Test 

only 

UPT 

ATRB 

Check 

flights 

Encoding  Speed 

2,219 

930 

625 

184 

Immediate /Delayed 

867 

567 

390 

132 

Memory 

Both  Tests 

867 

545 

376 

118 

III.  RESULTS  AND  DISCUSSION 
Encoding  Speed 


Descriptive  Measures 

Table  3  summarizes  the  average  response  times  and  percent  correct  scores  for  the  same  and 
different  judgments  for  each  of  the  three  types  of  encoding  decisions.  Accuracy  of  response  was 
high  for  all  parts  of  this  test  (85. 7t  to  95. 3i  correct).  This  was  considered  encouraging.  In 
keeping  with  comnon  practice  regarding  tasks  of  this  nature,  calculation  of  average  response 
times  was  based  only  on  those  trials  with  correct  responses. 
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Table  3.  Encoding  Speed:  Average  Response  Time  and  Percent  Correct 


Condition 

Number 
of  trials 

Response  time 
Mean 

(■S) 

SO 

i  Correct 

Physical  Identity 

same 

16 

662.9 

196.1 

95.3 

di fferent 

16 

758.8 

[710.9] 

200.9 

94.5 

[94.9] 

Name  Identity 

same 

16 

695.0 

162.7 

92.3 

di fferent 

16 

741.1 

[716.1] 

176.4 

94.5 
1.93. 4] 

Category  Identity 

same 

16 

925.1 

248.8 

85.7 

di  fferent 

16 

940.5 

[932.8] 

236.1 

92.7 

169.2] 

Note.  N  =  2,219. 


Average  response  time  was  slightly  lower  for  the  32  Physical  Identity  trials  (mean  *  710.9 
ms.)  than  for  the  Name  Identity  trials  (mean  =  718.1  ms.),  but  was  considerably  longer  for  the 
Category  Identity  trials  (mean  =  932.8  ms.).  Although  the  order  of  the  response  time  means  was 
consistent  with  Posner  and  MitchelTs  (1967)  "depth  of  processing"  model,  the  difference  in 
average  response  times  between  Physical  Identity  and  Name  Identity  trials  was  considerably  less 
than  that  reported  by  Posner  and  Mitchell  (1967)  (10  ms.  Instead  of  50  ms.).  Contrary  to  their 

depth  of  processing  model,  the  average  response  time  for  correct  "different"  judgment  trials  was 

somewhat  longer  for  Physical  Identity  trials  (mean  =  756.6  ms.)  than  for  Name  Identity  trials 
(mean  *  741.4  ms.).  These  results  are  depicted  in  Figure  1. 

For  the  present  investigation,  Cronbach's  reliability  coefficient  indicated  that  the  96  items 

(alpha  s  .711)  and  their  response  times  (alpha  *  .958)  were  reliable. 


Factor  Structure 


As  the  pattern  of  response  times  for  these  subjects  did  not  fit  the  depth  of  processing 
model,  differences  in  response  times  between  types  of  judgment  were  not  calculated.  Average 
response  time  for  correct  responses,  percent  correct,  and  a  response  time  by  percent  correct 
interaction  term  were  calculated  for  the  Physical,  Name,  ano  Category  Identity  trials  separately 
and  used  in  the  factor  analysis.  These  variables  were  chosen  to  reflect  three  important  aspects 
of  judgment:  speed,  accuracy,  and  speed-accuracy  tradeoffs  for  differing  levels  of  judgment. 

As  indicated  in  Table  4,  the  strongest  inter-item  correlations  occurred  between  variables  of 
the  same  type.  The  response  time  measures  were  related  strongly  to  each  other  (.567  1  rl  .  736) 
but  only  weakly  to  the  accuracy  scores  (.200  £_r  £.215)  and  Interaction  terms  (-.175  HI. 
-.017).  The  accuracy  scores  were  related  moderately  to  each  other  (.266  <  _r  £  .457),  but  were 
not  related  to  the  interaction  terms  (-.076  £_r  £  .043).  The  interaction  terms  were  not  related 
to  each  other  (-.02  8  <  r  <  .099). 


Table  4.  Encoding  Speed:  Inter-Item  Correlation  Matrix 


Variable 

1 

2 

3 

4 

5 

6 

7 

8 

9 

1. 

Average  RT  -  Physical 

1.000 

2. 

Average  RT  -  Name 

.633 

1.000 

3. 

Average  RT  -  Category 

.587 

.738 

1.000 

4. 

%  Correct  -  Physical 

.208 

.198 

.194 

1.000 

5. 

*  Correct  -  Name 

.156 

.215 

.179 

.296 

1.000 

6. 

%  Correct  -  Category 

.106 

.298 

.200 

.286 

.457 

1.000 

7. 

RT  by  %  -  Physical 

-.029 

.032 

.003 

-.076 

-.069 

-.085  1 

.000 

8. 

RT  by  *  -  Name 

.064 

-.017 

.060 

-.025 

.043 

-.101 

.099 

1.000 

9. 

RT  by  *  -  Category 

-.124 

-.160 

-.175 

-.015 

-.078 

-.031 

.028 

-.001 

1.000 

Note.  Variable  labels 

refer 

to  the 

average 

response  time, 

percent 

correct,  and 

average 

response  time  by  percent 

correct  interaction 

term 

for  the 

Physical , 

Name, 

and  Category 

Identity  conditions,  respectively. 

N  =  2,219. 

The  factor  analysis  yielded 

three 

factors 

that 

accounted 

for  57. 

7*  of 

the  total  item 

variance.  After  rotation, 

the  principal 

factor. 

which 

consisted  of  the 

three 

average 

response 

times,  accounted  for  65.4*  of  the  explained  variance.  The  three 

accuracy 

scores 

loaded 

on  Factor 

2. 

and  the  three  interaction  terms 

loaded  primarily  on 

Factor  3. 

Results 

of  the 

factor 

analysis 

are  shown  In  Table  5. 


Table  5.  Encoding  Speed:  Sunaary  of  Factor  Analysis 


Variable 

Commonality 

Factor 

1  Factor  2 

Factor  3 

1.  Average  RT 

-  Physical 

.509 

.702 

.104 

.074 

2.  Average  RT 

-  Name 

.816 

.879 

.187 

-.088 

3.  Average  RT 

-  Category 

.688 

.816 

.139 

.050 

4.  *  Correct 

-  Physical 

.187 

.174 

.392 

-.061 

5.  %  Correct 

-  Name 

.539 

.101 

.724 

.065 

6.  *  Correct 

-  Category 

.472 

.145 

.631 

-.229 

7.  RT  by  %  - 

Physical 

.037 

.025 

-.109 

.157 

8.  RT  by  *  - 

Name 

.274 

.034 

.010 

.522 

9.  RT  by  *  - 

Category 

.036 

-.183 

-.038 

-.029 

%  of  Explained 

Factor  Eigenvalue 

variance 

Cumulative  * 

1 

2.33 

65.4 

65.4 

2 

.89 

25.0 

90.4 

3 

.34 

9.6 

100.0 

Mote.  N  »  2,219. 


These  results  suggested  that  the  most  conceptually  Important  measure  provided  by  this  test 
was  general  response  latency.  The  accuracy  of  response  and  speed  by  accuracy  tradeoff  variables 
were  less  Important;  however,  they  did  make  unique  contributions  In  terms  of  explaining 
performance  on  this  test.  Based  on  these  results,  three  variables  were  retained  for  the  flight 
training  regression  analyses:  overall  response  time,  percent  correct,  and  a  response  time  by 
percent  correct  Interaction  term. 


To  evaluate  the  usefulness  of  the  cognitive  tests  for  predicting  flight  training  performance. 
It  was  necessary  to  determine  the  extent  to  which  the  current  selection  instrument  (l.e., 
AFOQT-Pilot  composite  score)  was  related  to  performance.  The  AFOQT-Pilot  composite  score 
demonstrated  a  modest  but  significant  relationship  only  with  UPT  final  outcome  (_r  =  .090,  £  < 
.05)  and  was  related  marginally  to  advanced  training  assignment  (_r  *  .099,  p  £  .10).  The 
AFOQT-PIlot  composite  score  was  not  related  to  check  flight  performance.  Results  of  the 
regression  analyses  are  summarized  in  Table  6. 

Table  6.  AFOQT-PIlot  Composite  Score:  Summary  of 
UPT  Outcome  Regression  Analyses 


As  can  be  seen  In  Table  7,  the  Encoding  Speed  model  was  not  related  significantly  to  UPT 
final  outcome  (R_  *  .101,  n.s.)  but  was  related  to  ATRB  rating  (R_  «  .165,  ££  .025).  It  should  be 
noted  that  average  response  latency  was  related  differentially  to  final  training  outcome  and 
advanced  training  recommendation.  Subjects  who  made  quick  responses  were  less  likely  to  complete 
UPT.  However,  of  those  who  successfully  completed  UPT,  those  who  made  quick  responses  were  more 
likely  to  be  recoamiended  for  advanced  training  in  fast  jets  (FAR  aircraft).  One  explanation  for 
these  results  Is  that  during  the  early  part  of  training  (T-37  phase),  cautious  actions  (slower 
responses)  are  considered  desirable.  As  a  result,  some  “reckless"  quick-responders  are 
eliminated.  During  the  later  phases  of  training  when  faster  aircraft  are  used  (T-38  phase),  the 
ability  to  respond  quickly  becomes  an  Important  asset.  This  explanation  Is  supported  by  the 
pattern  of  correlations  between  average  response  time  and  T-38  check  flight  performance  scores. 
Average  response  time  Is  negatively  correlated  with  performance  for  each  of  the  six  T-38  check 
flight  grades  and  percentage  scores.  Performance  during  the  T-38  phase.  In  turn,  has  a  direct 
Impact  on  the  advanced  training  assignment.  The  Encoding  Speed  model  was  related  significantly 
to  performance  on  two  of  the  three  T-37  check  flight  grades:  contact  (_R  *  .274,  £  <  .05)  and 
Instrument  (R  *  .262,  p  <  .05).  It  was  not  related  to  T-38  performance. 


Encoding  Speed:  Suatary  of  UPT  Outcome  Regression  Analyses 


the  combined  model  was  judged  to  be  significant.  Similarly,  the  components  of  the  Encoding  Speed  model 
tested  only  when  the  Encoding  Speed  model  and/or  the  combined  model  was  judged  to  be  significant.  The 
in  parentheses  indicates  the  significance  level  of  the  combined  model  when  it  was  marginally  significant. 


A  combined  model  that  used  the  AFOQT-PIlot  composite  score  and  the  three  Encoding  Speed 
scores  was  related  significantly  to  UPT  final  outcome  (R_  *  .156,  p^  <_  .01),  ATRB  rating  (R_  *  .176, 
p  _<  .025),  and  the  T -37  Instrument  flight  grade  (_R_  *  .310,  _p_  S.  and  was  relited  marginally  to 
the  T -37  contact  flight  grade  (£  *  .275,  p_  <  .10).  In  each  of  these  Instances,  the  combined 
model  significantly  improved  prediction  of  performance  beyond  the  level  of  prediction  provided  by 
the  AFOQT-PIlot  composite  score  alone.  These  results  suggested  that  the  Encoding  Speed  test 
measured  some  flight  training  performance-rel ated  ability  not  captured  by  the  AFOQT-PIlot 
composite  score.  The  combined  model  multiple  correlation  Increases  are  summarized  in  the  last 
column  of  Table  7. 


edl  ate /Delayed  Memory 


Descriptive  Measures 


Percent  correct  and  average  response  time  for  correct  responses  for  each  of  the  four  parts 
are  summarized  In  Table  8.  Accuracy  of  response  was  high  for  the  first,  second,  and  fourth  parts 
of  this  test  (averaging  91. 51  correct),  but  was  rather  low  for  part  three  (68.7%  correct).  This 
may  have  occurred  because  at  part  three  the  task  changed,  requiring  subjects  to  remember  a  digit 
that  was  presented  two  digits  back  (delayed  memory)  rather  than  one  digit  back.  Some  of  the 
subjects  may  have  been  confused  by  the  instructions  for  part  3  or  may  have  found  the  changed  task 
more  difficult.  Whatever  the  reason  for  the  decrement  In  performance,  the  subjects  recovered 
during  part  4  (delayed  memory  -  5-second  delay).  Despite  this  problem,  responses  on  this  test 
were  very  reliable  (alpha  ■  .938). 

Table  8.  I«ed1ate/Del«yed  Memory: 

Average  Response  Time  and  Percent  Correct 


Number 

Response  T1m  (ms.) 

Condition 

of  trials 

Mean  SD 

%  Correct 

257.0 

89 

217.9 

94 

308.2 

68 

252.8 

90 

One  Digit  Back 
2-second  delay 
5-second  delay 

Two  Digits  Back 
2-second  delay 
5-second  delay 


Note.  N  *  867. 

Average  response  time  for  correct  responses  was  fairly  consistent  across  the  four  parts  of 
this  test  (496  ms.,  457  ms.,  551  ms.,  and  494  ms.,  respectively)  and  was  very  reliable  over  the 
100  trials  (alpha  ■  .943). 

Results  from  the  accuracy  and  response  time  measures  suggested  that  there  was  an  accuracy  by 
response  time  interaction  due  primarily  to  the  part  3  trials.  Response  time  was  longer  and 
accuracy  was  lower  during  part  3. 


Factor  Structure 

The  most  conceptually  important  measures  from  this  test  were  average  response  time  and 
percent  correct  for  each  of  the  four  parts,  as  these  measures  provided  Information  regarding 
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Individual  differences  In  the  speed  and  accuracy  of  Immediate  and  delayed  memory  operations. 
Speed  by  accuracy  Interaction  terms  for  the  four  parts  were  Included  In  the  factor  analysis,  as 
speed/accuracy  tradeoffs  were  also  considered  Important. 

The  Inter-Item  correlation  matrix,  presented  In  Table  9,  Indicated  that  the  four  average 
response  time  measures  were  related  positively  to  each  other  (.287  _<  _r  _<  .553)  and  negatively  to 
their  respective  percent  correct  scores  (-.459  <  _r  <  -.285).  These  results  provided  additional 
evidence  of  a  speed/accuracy  tradeoff,  as  subjects  with  quicker  response  times  tended  to  be  less 
accurate.  The  four  percent  correct  scores  were  correlated  moderately  with  each  other  (.144  <  r_  < 
.397).  The  Interaction  terms  were  not  related  to  each  other  (-.027  £_r  <  .144),  but  were  related 
to  the  response  time  and  percent  correct  scores  that  contributed  to  the  Interaction  term. 

The  factor  analysis  yielded  five  factors  which  accounted  for  73.7%  of  the  total  Item 
variance.  Factors  1,  2,  4,  and  5  consisted  of  the  average  response  time,  percent  correct,  and 
the  response  time  by  percent  correct  Interaction  term  for  each  of  the  four  stimulus  conditions 
(1-  versus  2-back  by  2-  versus  5-second  delay).  Factor  3  can  be  thought  of  as  a  general 
“response  latency"  factor,  as  the  four  response  times  loaded  on  It.  A  sumnary  of  the  final 
factor  solution  Is  provided  In  Table  10. 

These  results  suggested  that  although  there  was  a  coimton  general  response  latency  component 
for  the  four  parts  of  this  test  (Factor  3),  the  task  demands  were  unique,  In  part,  for  each  of 
the  four  parts  of  this  test  (as  Indicated  by  Factors  1,  2,  4,  and  5).  As  a  result,  all  12 
variables  were  retained  for  use  In  the  prediction  of  flight  training  performance. 


Inferential  Measures 


The  Imnedlate/Delayed  Memory  model,  with  all  12  variables  from  the  factor  analysis, 
demonstrated  poor  predictive  utility  against  all  of  the  performance  criteria.  The  model  was  not 
related  to  UPT  pass/fall  outcome  (_R  ■  .172,  n.s.),  advanced  training  recommendation  (TTB/FAR)  (R^ 
*  .213,  n.s.),  or  check  flight  performance.  The  zero-order  correlations  between  the  12  variables 
in  the  Imnedlate/Delayed  Memory  model  and  the  various  performance  criteria  were  not  consistently 
In  the  same  direction.  This  lack  of  stability  was  not  surprising  as  very  few  of  these  zero-order 
correlations  were  significant  at  the  .05  level  of  probability.  A  summary  of  these  regression 
analyses  Is  provided  In  Table  11. 

As  can  be  seen  from  Table  11,  In  general,  neither  the  AFOQT-PIlot  composite  score  nor  the 
Imnedlate/Delayed  Memory  model  was  related  closely  to  flight  training  performance.  Subjects  with 
higher  AFOQT-PIlot  composite  scores  were  more  likely  to  successfully  complete  UPT  (_r  *  .090,  p  £ 
.05),  whereas  the  Imnedlate/Delayed  Memory  model  was  not  related  significantly  to  any  of  the 
training  performance  measures. 

A  combined  model  was  tested  to  determine  whether  the  AF0QT-P1 lot  composite  score  and  the 
Imnedlate/Delayed  Memory  model,  taken  together,  could  Improve  prediction  of  flight  training 
performance.  As  can  be  seen  from  the  last  two  columns  of  Table  11,  the  combined  model  was  only 
marginally  related  to  one  of  the  performance  measures:  UPT  pass/fall  (R^  •  .193,  p  <  .10).  The 
Imnedlate/Delayed  Memory  model  did  not  add  significantly  to  the  predictive  utility  provided  by 
the  AFOQT-PIlot  composite  alone. 


Full  Model 

When  considered  separately,  neither  the  AFOQT-PIlot  composite  score  nor  the  BAT  tests 
demonstrated  a  close,  consistent  relationship  with  all  three  performance  criteria.  The 
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10.  Uaedl at*/ Delayed  Meaory: 


AFOOT-PI  lot  composite  score  was  related  to  UPT  final  outcome,  but  not  to  advanced  training 
assignment  or  check  flight  performance.  In  contrast,  the  Encoding  Speed  model  was  not  related  to 
UPT  final  outcome,  but  was  related  to  advanced  training  assignment  and  performance  on  two  of  the 
three  T-37  check  flights.  Subjects  who  made  quick  responses  were  more  likely  to  be  recommended 
for  advanced  training  In  fast-jet  aircraft.  Scores  from  the  Irmedlate/Del  ayed  Memory  model  were 
not  related  at  all  to  pilot  training  performance. 

As  the  AFOQT-Pllot  composite.  Encoding  Speed  model,  and  Immediate/Delayed  Memory  model 
demonstrated  different  patterns  of  relationships  with  the  flight  training  performance  measures, 
it  was  felt  that  each  of  these  tests  may  have  been  measuring  some  unique  ability.  If  this  was 
so,  prediction  of  performance  might  be  Improved  by  using  measures  from  more  than  one  source  in  an 
integrated  model. 

Each  combined  model  was  evaluated  to  determine  whether  scores  from  the  experimental  tests 
(Encoding  Speed  and  Immediate/Delayed  Memory)  improved  prediction  of  flight  training  performance 
beyond  the  level  of  prediction  provided  by  the  currently  used  selection  measure  (AFOQT-Pllot 
composite  score).  A  combined  model  that  used  the  AFOQT-Pllot  composite  along  with  the  Encoding 
Speed  measures  significantly  improved  the  prediction  of  UPT  final  outcome,  advanced  training 
recommendation,  and  two  of  the  three  T-37  check  flight  grades  (see  Table  7).  The  Immediate/ 
Delayed  Memory  measures  used  In  combination  with  the  AFOQT-Pllot  composite,  however,  did  not 
improve  prediction  of  flight  training  performance  (see  Table  11). 

A  “full  model"  that  used  the  AFOQT-Pilot  composite  score  along  with  both  the  Encoding  Speed 
and  Immediate/Delayed  memory  measures  was  related  significantly  only  to  UPT  final  outcome  (R_  * 
.230,  p  .05).  The  full  model  did  predict  final  outcome  better  than  the  AFOQT-Pllot  composite 
alone  (_r  *  .090)  (F[15,529]  *  1.67,  £  .05)  but  did  not  differ  significantly  from  a  reduced 
model  consisting  of  the  AFOQT-Pllot  composite  and  the  Encoding  Speed  measures  only  (R^  *  .156) 
(_F[1 2,532]  1  1.34,  n.s.).  The  full  model  regression  analyses  are  summarized  in  Table  12. 

A  comparison  of  the  various  combined  models  suggested  that  the  Encoding  Speed  model  was  able 
to  improve  flight  training  performance  prediction  beyond  that  provided  by  the  AFOQT-Pllot 
composite,  whereas  the  Immedfate/Delayed  Memory  model  was  not.  Based  on  these  results,  the 
Inmedlate/Delayed  Memory  test  should  probably  not  be  retained  In  the  BAT  battery. 


IV.  CONCLUSION 

Performance  measures  from  both  cognitive  tests  were  sufficiently  reliable  to  be  used  for 
selection  purposes;  however,  only  scores  from  the  Encoding  Speed  test  were  related  to  flight 
training  performance.  Those  from  the  Inmedlate/Delayed  Memory  test  were  not. 

Encoding  Speed  scores,  when  used  In  combination  with  a  currently  used  selection  instrument 
(AFOQT-Pllot  composite  score),  improved  the  prediction  of  several  flight  training  performance 
criteria,  including  successful  completion  of  training,  advanced  training  recommendation,  and 
check  flight  performance.  It  should  be  noted,  however,  that  average  response  time  on  the 
Encoding  Speed  test  was  related  differentially  to  final  training  outcome  versus  T-38  performance 
and  advanced  training  recommendation.  Slower,  moie  cautious  subjects  were  more  likely  to 
complete  training  successfully.  Among  those  who  graduated  from  UPT,  however,  those  who  made 
quick  and  accurate  responses  on  the  Encoding  Speed  test  were  more  likely  to  perform  well  on  the 
T-38  check  flights  and  receive  a  post-UPT  recommendation  for  advanced  training  in  a  FAR 
aircraft.  The  latter  relationship  Is  consistent  with  results  from  other  cognitive  tests  In  the 
BAT  battery  (Decision-Making  Speed,  Item  Recognition,  Mental  Rotation,  Time-Sharing;  see 
Carretta,  1987a,  1987b,  1987c).  These  cognitive  tests  may  be  most  useful  when  It  Is  desirable 
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to  make  specialized  track  assignments  early  in  training  or  when  only  fighter-recoemended  (FAR)  or 
non-fighter-recommended  (TTB)  pilots  are  needed  (Euro-NATO  Joint  Jet  Pilot  Training  or  Air 
National  Guard  units). 

The  next  step  In  the  evaluation  of  the  seven  BAT  cognitive/perceptual  tests  will  be  to 
determine  whether  they  make  unique  contributions  to  the  prediction  of  flight  training  performance 
when  considered  together.  If  they  are  redundant,  some  of  them  may  be  removed  from  the  BAT 
battery  and  replaced  by  tests  of  other  abilities. 
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